Fractal and Transfractal Scale-Free Networks 



OO 
O 

o 

(N 



< 



43 



Hernan D. Rozenfeld^'B Lazaros K. Gallos, 1 '!! Chaoming Song, 1 '!! and Hernan A. Makse x '[l 

1 Levich Institute and Physics Department, City College of New York, New York, New York 10031, USA 



Article Outline 



Glossary and Notation 

1. Definition of the Subject and Its Importance 

2. Introduction 



bjo: 

3. Fractality in Real- World Networks 



4. Models: Deterministic Fractal and Transfractal Networks 

5. Properties of Fractal and Transfractal Networks 

6. Future Directions 

7. APPENDIX: The Box Covering Algorithms 



O 

Q ' 8. Bibliography 

O ' Glossary and Notation 

!>v 

Degree of a node Number of edges incident to the node. 

P-i. 

!— i, Scale- Free Network Network that exhibits a wide (usually power-law) distribution of the degrees. 

Small- World Network Network for which the diameter increases logarithmically with the number of nodes. 

: 

f^) , Distance The length (measured in number of links) of the shortest path between two nodes. 

(N ■ 

Box Group of nodes. In a connected box there exists a path within the box between any pair of nodes. 
OO ' Otherwise, the box is disconnected. 

o 

OO 
O 



Box Diameter The longest distance in a box. 
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DEFINITION OF THE SUBJECT AND ITS IMPORTANCE 

The explosion in the study of complex networks during the last decade has offered a unique view in the structure 
and behavior of a wide range of systems, spanning many different disciplines The importance of complex networks 
lies mainly in their simplicity, since they can represent practically any system with interactions in a unified way by 
stripping complicated details and retaining the main features of the system. The resulting networks include only 
nodes, representing the interacting agents and links, representing interactions. The term 'interactions' is used loosely 
to describe any possible way that causes two nodes to form a link. Examples can be real physical links, such as the 
wires connecting computers in the Internet or roads connecting cities, or alternatively they may be virtual links, such 
as links in WWW homepages or acquaintances in societies, where there is no physical medium actually connecting 
the nodes. 

The field was pioneered by the famous mathematician P. Erdos many decades ago, when he greatly advanced graph 
theory [2| . The theory of networks would have perhaps remained a problem of mathematical beauty, if it was not for 
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the discovery that a huge number of everyday life systems share many common features and can thus be described 
through a unified theory. The remarkable diversity of these systems incorporates artificially or man-made technological 
networks such as the Internet and the World Wide Web (WWW), social networks such as social acquaintances or 
sexual contacts, biological networks of natural origin, such as the network of protein interactions of Yeast [H, 0], and a 
rich variety of other systems, such as proximity of words in literature jij, items that are bought by the same people [B[ 
or the way modules are connected to create a piece of software, among many others. 

The advances in our understanding of networks, combined with the increasing availability of many databases, 
allows us to analyze and gain deeper insight into the main characteristics of these complex systems. A large number 
of complex networks share the scale-free property [3, @] , indicating the presence of few highly connected nodes (usually 
called hubs) and a large number of nodes with small degree. This feature alone has a great impact on the analysis 
of complex networks and has introduced a new way of understanding these systems. This property carries important 
implications in many everyday life problems, such as the way a disease spreads in communities of individuals, or the 
resilience and tolerance of networks under random and intentional attacks 0, IE IE EE EH • 

Although the scale-free property holds an undisputed importance, it has been shown to not completely determine 
the global structure of networks [13]. In fact, two networks that obey the same distribution of the degrees may 
dramatically differ in other fundamental structural properties, such as in correlations between degrees or in the 
average distance between nodes. Another fundamental property, which is the focus of this article, is the presence 
of self-similarity or fractality. In simpler terms, we want to know whether a subsection of the network looks much 
the same as the whole [H E3, EE El- In spite of the fact that in regular fractal objects the distinction between 
self-similarity and fractality is absent, in network theory we can distinguish the two terms: in a fractal network 
the number of boxes of a given size that are needed to completely cover the network scales with the box size as a 
power law, while a self-similar network is defined as a network whose degree distribution remains invariant under 
renormalization of the network (details on the renormalization process will be provided later). This essential result 
allows us to better understand the origin of important structural properties of networks such as the power-law degree 
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INTRODUCTION 



Self-similarity is a property of fractal structures, a concept introduced by Mandelbrot and one of the fundamental 
mathematical results of the 20th century 1J, |l5j, |20j . The importance of fractal geometry stems from the fact that 
these structures were recognized in numerous examples in Nature, from the coexistence of liquid/gas at the critical 
point of evaporation of water 21, 22, [23[, to snowflakes, to the tortuous coastline of the Norwegian fjords, to the 
behavior of many complex systems such as economic data, or the complex patterns of human agglomeration (lil . EH . 

Typically, real world scale- free networks exhibit the small- world property [l|, which implies that the number of 
nodes increases exponentially with the diameter of the network, rather than the power-law behavior expected for 
self-similar structures. For this reason complex networks were believed to not be length-scale invariant or self-similar. 

In 2005, C. Song, S. Havlin and H. Makse presented an approach to analyze complex networks, that reveals their 
self-similarity [17| . This result is achieved by the application of a renormalization procedure which coarse-grains the 
system into boxes containing nodes within a given size 17j,l24|. As a result, a power-law relation between the number 
of boxes needed to cover the network and the size of the box is found, defining a finite self-similar exponent. These 
fundamental properties, which are shown for the WWW, cellular and protein-protein interaction networks, help to 
understand the emergence of the scale-free property in complex networks. They suggest a common self-organization 
dynamics of diverse networks at different scales into a critical state and in turn bring together previously unrelated 
fields: the statistical physics of complex networks with renormalization group, fractals and critical phenomena. 



FRACTALITY IN REAL- WORLD NETWORKS 



The study of real complex networks has revealed that many of them share some fundamental common properties. 
Of great importance is the form of the degree distribution for these networks, which is unexpectedly wide. This means 
that the degree of a node may assume values that span many decades. Thus, although the majority of nodes have 
a relatively small degree, there is a finite probability that a few nodes will have degree of the order of thousands or 
even millions. Networks that exhibit such a wide distribution P(k) are known as scale-free networks, where the term 
refers to the absence of a characteristic scale in the degree k. This distribution very often obeys a power-law form 
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with a degree exponent 7, usually in the range 2 < 7 < 4 

P(fc)~fc~ 7 . (1) 

A more generic property, that is usually inherent in scale-free networks but applies equally well to other types of 
networks, such as in Erdos-Renyi random graphs, is the small-world feature. Originally discovered in sociological 
studies [261 ]. it is the generalization of the famous 'six degrees of separation' and refers to the very small network 
diameter. Indeed, in small-world networks a very small number of steps is required to reach a given node starting 
from any other node. Mathematically this is expressed by the slow (logarithmic) increase of the average diameter of 
the network, £, with the total number of nodes N, £ ~ In TV, where £ is the shortest distance between two nodes and 
defines the distance metric in complex networks 0, 0, [U [3] , namely, 

N (2) 

where £q is a characteristic length. 

These network characteristics have been shown to apply in many empirical studies of diverse systems 0, @, [25| . The 
simple knowledge that a network has the scale- free and/or small- world property already enables us to qualitatively 
recognize many of its basic properties. However, structures that have the same degree exponents may still differ 



in other aspects [12|. For example, a question of fundamental importance is whether scale-free networks are also 
self-similar or fractals. The illustrations of scale-free networks (see e.g. Figs. Q] and [5Jd) seem to resemble traditional 
fractal objects. Despite this similarity, Eq. definitely appears to contradict a basic property of fractality: fast 
increase of the diameter with the system size. Moreover, a fractal object should be self-similar or invariant under a 
scale transformation, which is again not clear in the case of scale-free networks where the scale has necessarily limited 
range. So, how is it even possible that fractal scale-free networks exist? In the following, we will see how these 
seemingly contradictory aspects can be reconciled. 



Fractality and Self-Similarity 

The classical theory of self-similarity requires a power-law relation between the number of nodes N and the diameter 
of a fractal object £ [l3l Il6j |. The fractal dimension can be calculated using either box-counting or cluster- growing 
techniques In the first method the network is covered with Nb boxes of linear size £b- The fractal dimension or 
box dimension ds is then given by [15l |: 

N B ~ e B dB . (3) 
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In the second method, instead of covering the network with boxes, a random seed node is chosen and nodes centered at 
the seed are grown so that they are separated by a maximum distance t. The procedure is then repeated by choosing 
many seed nodes at random and the average "mass" of the resulting clusters, (M c ) (defined as the number of nodes 
in the cluster) is calculated as a function of £ to obtain the following scaling: 

(M c )~£ d f, (4) 

defining the fractal cluster dimension df [15| |. If we use Eq. (HJ) for a small- world network, then Eq. ^ readily implies 
that df = oo. In other words, these networks cannot be characterized by a finite fractal dimension, and should be 
regarded as infinite-dimensional objects. If this were true, though, local properties in a part of the network would not 
be able to represent the whole system. Still, it is also well established that the scale-free nature is similar in different 
parts of the network. Moreover, a graphical representation of real- world networks allows us to see that those systems 
seem to be built by attaching (following some rule) copies of itself. 

The answer lies in the inherent inhomogeneity of the network. In the classical case of a homogeneous system (such 
as a fractal percolation cluster) the degree distribution is very narrow and the two methods described above are 
fully equivalent, because of this local neighborhood invariance. Indeed, all boxes in the box-covering method are 
statistically similar with each other as well as with the boxes grown when using the cluster-growing technique, so that 
Eq. can be derived from Eq. ([3]) and d B = df. 

In inhomogeneous systems, though, the local environment can vary significantly. In this case, Eqs. ([3]) and ^ 
are no longer equivalent. If we focus on the box-covering technique then we want to cover the entire network with 
the minimum possible number of boxes Nb(£b), where the distance between any two nodes that belong in a box is 
smaller than Ib- An example is shown in Fig.[2K using a simple 8-node network. After we repeat this procedure for 
different values of Ib we can plot Nb vs Ib- 



When the box-covering method is applied to real large-scale networks, such us the WWW [25 1 
(http://www.nd.edu/~networks), the network of protein interaction of H. sapiens and E. coli p^ . [30j | and sev- 
eral cellular networks [3l|, [32| > then they follow Eq. ([3|) with a clear power-law, indicating the fractal nature of these 
systems (Fig s. |3K, Gh>, Et). On the other hand when the method is applied to other real world networks such us the 
Internet |33l | or the Barabasi- Albert network [34j . they do not satisfy Eq. (J3j> , which manifests that these networks 
are not fractal. 

The reason behind the discrepancy in the fractality of homogeneous and inhomogeneous systems can be better 
clarified studying the mass of the boxes. For a given Ib value, the average mass of a box (Mb{(-b)) is 

(M B (£ B )) = N/N b (£b) ~ l d B B > (5) 

as also verified in Fig. [3J for several real-world networks. On the other hand, the average performed in the cluster 
growing method (averaging over single boxes without tiling the system) gives rise to an exponential growth of the 
mass 

W~e </<1 , (6) 

in accordance with the small- world effect, Eq. ([2]). Correspondingly, the probability distribution of the mass of 
the boxes Mb using box-covering is very broad, while the cluster-growing technique leads to a narrow probability 
distribution of M c . 

The topology of scale-free networks is dominated by several highly connected hubs — the nodes with the largest 
degree — implying that most of the nodes are connected to the hubs via one or very few steps. Therefore, the average 
performed in the cluster growing method is biased; the hubs are overrepresented in Eq. ([6]) since almost every node 
is a neighbor of a hub, and there is always a very large probability of including the same hubs in all clusters. On 
the other hand, the box covering method is a global tiling of the system providing a flat average over all the nodes, 
i.e. each part of the network is covered with an equal probability. Once a hub (or any node) is covered, it cannot be 
covered again. 

In conclusion, we can state that the two dominant methods that are routinely used for calculations of fractality 
and give rise to Eqs. (J3j and (j4|) are not equivalent in scale-free networks, but rather highlight different aspects: box 
covering reveals the self-similarity, while cluster growth reveals the small-world effect. The apparent contradiction is 
due to the hubs being used many times in the latter method. 

The apparent contradiction between the small- world and fractal properties, as expressed through Eqs. [2] and [3J can 
be explained as follows. Scale-free networks can be classified into three groups: (i) pure fractal, (ii) pure small-world 
and (iii) a mixture between fractal and small- world, (i) A fractal network satisfies Eq. |3j at all scales, meaning that 
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FIG. 2: The renormalization procedure for complex networks, a, 
Demonstration of the method for different Ib and different stages in a network demo. The first column depicts the original 
network. The system is tiled with boxes of size Ib (different colors correspond to different boxes). All nodes in a box are 
connected by a minimum distance smaller than the given Ib- For instance, in the case of Ib = 2, one identifies four boxes 
which contain the nodes depicted with colors red, orange, white, and blue, each containing 3, 2, I, and 2 nodes, respectively. 
Then each box is replace by a single node; two renormalized nodes are connected if there is at least one link between the 
unrenormalized boxes. Thus we obtain the network shown in the second column. The resulting number of boxes needed to 
tile the network, Nb(Ib), is plotted in Fig. [4] versus £b to obtain cLb as in Eq. (0. The renormalization procedure is applied 
again and repeated until the network is reduced to a single node (third and fourth columns for different Ib)- b, Three stages 
in the renormalization scheme applied to the entire WWW. We fix the box size to Ib = 3 and apply the renormalization for 
four stages. This corresponds, for instance, to the sequence for the network demo depicted in the second row in part a of this 
figure. We color the nodes in the web according to the boxes to which they belong. 

for any value of £b, the number of boxes always follows a power-law (examples are shown in Fig. [3^,, [SJa, [St), (ii) 
When a network is a pure small- world, it never satisfies Eq. [3) Instead, Nb follows an exponential decay with £b and 
the network cannot be regarded as fractal. Figs.[3ji and[3j5 show two examples of pure small- world networks, (iii) In 
the case of mixture between fractal and small- world, Eq. [3J is satisfied up to some cut-off value of Ib , above which the 
fractality breaks down and the small-world property emerges. The small-world property is reflected in the plot of Nb 
vs. as an exponential cut-off for large Ib- 

We can also understand the coexistence of the small-world property and the fractality through a more intuitive 
approach. In a pure fractal network the length of a path between any pair of nodes scales as a power-law with the 
number of nodes in the network. Therefore, the diameter L also follows a power- law, L ~ N l / dB . If one adds a 
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FIG. 3: Self-similar scaling in complex networks, a, Upper panel: Log-log plot of the Nb vs Ib revealing the self-similarity 
of the WWW according to Eq. ©. Lower panel: The scaling of s(£b) vs. Ib according to Eq. ((9j). b, Same as (a) but for 
two protein interaction networks: H. sapiens and E. coli. Results are analogous to (b) but with different scaling exponents, c, 
Same as (a) for the cellular networks of A. fulgtdus, E. coli and C. elegans. d, Internet. Log-log plot of Nb(£b)- The solid line 
shows that the internet [331 ] is not a fractal network since in does not follow the power-law relation of Eq.([5}. e, Same as (d) 
for the Barabasi- Albert model network [34] with m = 3 and m = 5. 
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FIG. 4: Invariance of the degree distribution of the WWW under the renormalization for different box sizes, IB . We show 
the data collapse of the degree distributions demonstrating the self-similarity at different scales. The inset shows the scaling 
of k' = s(£s)fc for different Ib , from where we obtain the scaling factor s(£b)- Moreover, renormalization for a fixed box size 
(Ib = 3) is applied, until the network is reduced to a few nodes. It was found that P(k) is invariant under these multiple 
renormalizations procedures. 



few shortcuts (links between randomly chosen nodes), many paths in the network are drastically shortened and the 
small- world property emerges as L ~ Log N. In spite of this fact, for shorter scales, Ib <C L, the network still behaves 
as a fractal. In this sense, we can say that globally the network is small-world, but locally (for short scales) the 
network behaves as a fractal. As more shortcuts are added, the cut-off in a plot of Nb vs. Ib appears for smaller Ib, 
until the network becomes a pure small-world for which all paths lengths increase logarithmically with N. 

The reasons why certain networks have evolved towards a fractal or non-fractal structure will be described later, 
together with models and examples that provide additional insight into the processes involved. 



Renormalization 

Renormalization is one of the most important techniques in modern Statistical Physics [il HE 113 ■ The idea behind 
this procedure is to continuously create smaller replicas of a given object, retaining at the same time the essential 
structural features, and hoping that the coarse-grained copies will be more amenable to analytic treatment. 

The idea for renormalizing the network emerges naturally from the concept of fractality described above. If a 
network is self-similar, then it will look more or less the same under different scales. The way to observe these 
different length-scales is based on renormalization principles, while the criterion to decide on whether a renormalized 
structure retains its form is the invariance of the main structural features, expressed mainly through the degree 
distribution. 

The method works as follows. Start by fixing the value of £b and applying the box-covering algorithm in order to 
cover the entire network with boxes (see Appendix). In the renormalized network each box is replaced by a single 
node and two nodes are connected if there existed at least one connection between the two corresponding boxes in 
the original network. The resulting structure represents the first stage of the renormalized network. We can apply 
the same procedure to this new network, as well, resulting in the second renormalization stage network, and so on 
until we are left with a single node. 

The second column of the panels in Fig. [2^ shows this step in the renormalization procedure for the schematic 
network, while Fig. [2Jd shows the results for the same procedure applied to the entire WWW for Ib = 3. 

The renormalized network gives rise to a new probability distribution of links, P(k') (we use a prime ' to denote 
quantities in the renormalized network). This distribution remains invariant under the renormalization: 

P(k) -> P(k') ~ (fc')~ 7 - (7) 



Fig. [4] supports the validity of this scale transformation by showing a data collapse of all distributions with the same 
7 according to {7J for the WWW. 
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Here, we present the basic scaling relations that characterize renormalizable networks. The degree k' of each node 
in the renormalized network can be seen to scale with the largest degree k in the corresponding original box as 

k^k' = s (e B )k. (8) 

This equation defines the scaling transformation in the connectivity distribution. Empirically, it was found that the 
scaling factor s(< 1) scales with £b with a new exponent, dk, as s(£b) ~ £ B ■> so that 

k'^£- d "k, (9) 

This scaling is verified for many networks, as shown in Fig. [4^. 

The exponents 7, ds, and d^ are not all independent from each other. The proof starts from the density balance 
equation n(k)dk — n'(k')dk', where n{k) = NP(k) is the number of nodes with degree k and n'(k') = N'P(k') is 
the number of nodes with degree k' after the renormalization (TV' is the total number of nodes in the renormalized 
network). Substituting Eq. {8]) leads to N' = s~ f ~ 1 N. Since the total number of nodes in the renormalized network is 
the number of boxes needed to cover the unrenormalized network at any given £b we have the identity N' = Nb(£b)- 
Finally, from Eqs. ([3|) and |9j) one obtains the relation between the three indexes 

j=l + d B /d k . (10) 

The use of Eq. (|10|) yields the same 7 exponent as that obtained in the direct calculation of the degree distribution. 
The significance of this result is that the scale-free properties characterized by 7 can be related to a more fundamental 
length-scale invariant property, characterized by the two new indexes ds and d^. 

We have seen, thus, that concepts introduced originally for the study of critical phenomena in statistical physics, 
are also valid in the characterization of a different class of phenomena: the topology of complex networks. A large 
number of scale-free networks are fractals and an even larger number remain invariant under a scale-transformation. 
The influence of these features on the network properties will be delayed until the sixth chapter, after we introduce 
some algorithms for efficient numerical calculations and two theoretical models that give rise to fractal networks. 



MODELS: DETERMINISTIC FRACTAL AND TRANSFRACTAL NETWORKS 

The first model of a scale-free fractal network was presented in 1979 when N. Berker and S. Ostlund [38| proposed 
a hierarchical network that served as an exotic example where renormalization group techniques yield exact results, 
including the percolation phase transition and the q — ► 1 limit of the Potts model. Unfortunately, in those days the 
importance of the power-law degree distribution and the concept of fractal and non-fractal complex networks were 
not known. Much work has been done on these types of hierarchical networks. For example, in 1984, M. Kaufman 
and R. Griffiths made use of Berker and Ostlund's model to study the percolation phase transition and its percolation 
exponents 

©in 53. 



Since the late 90 's, when the importance of the power-law degree distribution was first shown [l| and after the 



finding of C. Song, S. Havlin and H. Makse [17j, many hierarchical networks that describe fractality in complex 
networks have been proposed. These artificial models are of great importance since they provide insight into the 
origins and fundamental properties that give rise to the fractality and non- fractality of networks. 



The Song-Havlin-Makse Model 



The correlations between degrees in a network [42|, |43|, |44j, |45( are quantified through the probability P(k\, k^) that 
a node of degree k\ is connected to another node of degree fe- In Fig. [5] we can see the degree correlation profile 
R(k\, kz) = P(ki, k2) / P r {k\, k%) of the cellular metabolic network of E. coli j4(| (known to be a fractal network) and 
for the Internet at the router level [13] (a non-fractal network), where P r {k\,k-2) is obtained by randomly swapping 
the links without modifying the degree distribution. 

Fig. [5] shows a dramatic difference between the two networks. The network of E. coli, that is a fractal network, 
present an anti-correlation of the degrees (or dissasortativity 4^, 43[), which means that mostly high degree nodes 
are linked to low degree nodes. This property leads to fractal networks. On the other hand, the Internet exhibits a 
high correlation between degrees leading to a non-fractal network. 

With this idea in mind, in 2006 C. Song, S. Havlin and H. Makse presented a model that elucidates the way new 
nodes must be connected to the old ones in order to build a fractal, a non-fractal network, or a mixture between 
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(a) h (b) h 



FIG. 5: Degree correlation profile for (a) the cellular metabolic network of E. coli, and (b) the Internet at the router level. 

X 

T 

X 
V 

FIG. 6: The model grows from a small network, usually two nodes connected to each other. During each step and for every 
link in the system, each endpoint of a link produces m offspring nodes (in this drawing m = 3). In this case, with probability 
e = 1 the original link is removed and x new links between randomly selected nodes of the new generation are added. Notice 
that the case of x = 1 results in a tree structure, while loops appear for x > 1. 




fractal and non-fractal network [181 ] . This model shows that, indeed, the correlations between degrees of the nodes 
are a determinant factor for the fractality of a network. This model was later extended [48[ to allow loops in the 
network, while preserving the self-similarity and fractality properties. 

The algorithm is as follows (see Fig. [5]): In generation n — 0, start with two nodes connected by one link. Then, 
generation n + 1 is obtained recursively by attaching m new nodes to the endpoints of each link I of generation n. 
In addition, with probability e remove link I and add x new links connecting pairs of new nodes attached to the 
endpoints of I. 

The degree distribution, diameter and fractal dimension can be easily calculated. For example, if e = 1 (pure fractal 
network), the degree distribution follows a power-law P(k) ~ k~ 7 with exponent 7 = 1 + log(2m + x)/log m and 
the fractal dimension is ds = log(2m + x)/\og m. The diameter L scales, in this case, as power of the number of 
nodes as L ~ N 1 ^ 3 jlSl. l24j. Later, in Section "Properties of Fractal and Transfractal Networks", several topological 
properties are shown for this model network. 



(u, ii)-flowers 

In 2006, H. Rozenfeld, S. Havlin and D. ben-Avraham proposed a new family of recursive deterministic scale-free 
networks, the (u, v)-flowers, that generalize both, the original scale-free model of Berker and Ostlund [38| and the 
pseudo-fractal network of Dorogovstev, Goltsev and Mendes [49j and that, by appropriately varying its two parameters 
u and v, leads to either fractal networks or non-fractal networks 5(| 51 1- The algorithm to build the (w, v)-fiowers is 



10 



the following: In generation n — 1 one starts with a cycle graph (a ring) consisting of u + v = w links and nodes (other 
choices are possible) . Then, generation n + 1 is obtained recursively by replacing each link by two parallel paths of u 
and v links long. Without loss of generality, u < v. Examples of (1, 3)- and (2, 2)-flowers are shown in Fig. \7\ The 
DGM network corresponds to the special case of u = 1 and v — 2 and the Berker and Ostlund model corresponds to 
u = 2 and v = 2. 

An essential property of the (u, v)-fiowers is that they are self-similar, as evident from an equivalent method of 
construction: to produce generation n + 1, make w = u + v copies of the net in generation n and join them at the 
hubs. 





FIG. 7: (it, w)-flowers with u + v = 4 (7 = 3). (a) u = 1 (dotted line) and v — 3 (broken line), (b) u = 2 and v = 2. The graphs 
may also be iterated by joining four replicas of generation n at the hubs A and B, for (a), or A and C, for (b). 



The number of links of a (u, u)-flower of generation n is 

M n = (u + v) n =w n , (11) 

and the number of nodes is 

^ = (^V+(^t)- (12) 

\w — 1/ \w — 1/ 

The degree distribution of the (u, w)-flowers can also be easily obtained since by construction, (u, u)-flowers have 
only nodes of degree k = 2 m , m = 1, 2, . . . ,n. As in the DGM case, (u, v)-flowers follow a scale-free degree distribution, 
P(fc) ~ fc -7 , of degree exponent 

y-l + Zg*. (13, 

Recursive scale- free irees may be defined in analogy to the flower nets. If v is even, one obtains generation n + 1 
of a (w, u)-tree by replacing every link in generation n with a chain of zt links, and attaching to each of its endpoints 
chains of v/2 links. Fig. [5] shows how this works for the (l,2)-tree. If v is odd, attach to the endpoints (of the 
chain of u links) chains of length (y ± l)/2. The trees may be also constructed by successively joining w replicas 
at the appropriate hubs, and they too are self-similar. They share many of the fundamental scaling properties with 
(it, u)-flowers: Their degree distribution is also scale-free, with the same degree exponent as (it, u)-flowers. 

The self-similarity of (it, u)-flowers, coupled with the fact that different replicas meet at a single node, makes them 
amenable to exact analysis by renormalization techniques. The lack of loops, in the case of (it, i>)-trees, further 
simplifies their analysis [38l . IHo . 51 , 52| . 



11 



(a) 



n + 1 




n=1 

( c ) n=2 n=3 



-H K 








N 





FIG. 8: The (1, 2)-tree. (a) Each link in generation n is replaced by a chain of it = 1 links, to which ends one attaches chains of 
v/2 = 1 links, (b) Alternative method of construction highlighting self-similarity: u + v — 3 replicas of generation n are joined 
at the hubs, (c) Generations n = 1, 2, 3. 



Dimensionality of the (u,v) -flowers 



There is a vast difference between (u, v)-nets with u = 1 and u > 1. If it = 1 the diameter L n of the n-th generation 
flower scales linearly with n. For example, L n for the (1, 2)-flower (4j| and L„ = 2n for the (1, 3)-fiower. It is easy to 
see that the diameter of the (1, v )-flower, for v odd, is L n = (v — l)n + (3 — v)/2, and, in general one can show that 
L n ~ (v - l)n. 

For u > 1, however, the diameter grows as a power of n. For example, for the (2, 2)-flower we find L n — 2™, and, 
more generally, the diameter satisfies L n ~ «". To summarize, 

r f (v — l)n it = 1, n 

u>l, fl0W6rS - ^ 

Similar results are quite obvious for the case of (u, w)-trees, where 

L n ~ i W " ^ ~ / trees. (15) 



u" u > 1, 

Since iV„ ~ (u + v) n [Eq. (TT2"]) ], we can recast these relations as 

r / ln7V u = 1 > 

\ ATln«/ln(«+i>) y > ! U»J 

Thus, (m, w)-nets are small world only in the case of u = 1. For u > 1, the diameter increases as a power of iV, just 
as in /tmfe-dimensional objects, and the nets are in fact fractal. For u > 1, the change of mass upon the rescaling of 
length by a factor b is 

N(bL) — b ds N(L) , (17) 
where oIb is the fractal dimension In this case, N(uL) = (u + v)N(L), so 

rf B = — ^ , u > 1 . (18) 
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Transfinite Fractals 



Small world nets, such as (l,v)-nets, are infinite-dimensional. Indeed, their mass (N, or M) increases faster than 
any power (dimension) of their diameter. Also, note that a naive application of ([4]) to u — »■ 1 yields df — > oo. In the 
case of (l,ii)-nets one can use their weak self-similarity to define a new measure of dimensionality, df, characterizing 
how mass scales with diameter: 

N{L + £) = e ldi N{L) . (19) 

Instead of a multiplicative rescaling of length, L bL, a slower additive mapping, L i— > L + £, that reflects the 
small world property is considered. Because the exponent df usefully distinguishes between different graphs of infinite 
dimensionality, df has been termed the transfinite fractal dimension of the network. Accordingly, objects that are 
self-similar and have infinite dimension (but finite transfinite dimension), such as the (1, w)-nets, are termed transfinite 
fractals, or transfractals , for short. 

For (1, v)-nets, we see that upon 'zooming in' one generation level the mass increases by a factor of w = 1 + v, while 
the diameter grows from L to L + v — 1 (for flowers), or to L + v (trees). Hence their transfractal dimension is 

(1, w)-trees, 

(20) 

Ml+Hl (l, v )-flowers. 

v — 1 V ' ' 

There is some arbitrariness in the selection of e as the base of the exponential in the definition (fTO|) . However the 
base is inconsequential for the sake of comparison between dimensionalities of different objects. Also, scaling relations 
between various transfinite exponents hold, irrespective of the choice of base: consider the scaling relation of Eq. ITD1 




valid for fractal scale-free nets of degree exponent 7 [17], |18|. For example, in the fractal (w, u)-nets (with u > 1) 
renormalization reduces lengths by a factor b = u and all degrees are reduced by a factor of 2, so 6 dk = 2. Thus 
c?k = ln2/lnw, and since ds = in(u + v)/\nu and 7 = 1 + hi(u + u)/ln2, as discussed above, the relation (fTU)) is 
indeed satisfied. 

For transfractals, renormalization reduces distances by an additive length, i, and we express the self-similarity 
manifest in the degree distribution as 

P'(k) = e" k P(e-" k /fc) , (21) 

where d^ is the transfinite exponent analogous to d^. Renormalization of the transfractal (l,f)-nets reduces the link 
lengths by I — v — 1 (for flowers), or I — v (trees), while all degrees are halved. Thus, 



d k = 



(M)-trecs, 
^ (l,i;)-flowers. 
Along with (f^U)) , this result confirms that the scaling relation 

7=l + x ( 22 ) 
is valid also for transfractals, and regardless of the choice of base. A general proof of this relation is practically 



identical to the proof of (??) [17[, merely replacing fractal with transfractal scaling throughout the argument. 

For scale- free transfractals, following m = Ljl renormalizations the diameter and mass reduce to order one, and 
the scaling (HU) implies L ~ m£, N ~ e mMf , so that 

L - -LlniV, 
df 

in accordance with their small world property. At the same time the scaling (|21l) implies K ~ e mM k ; or j{ ^ j\fd k /d^ 
Using the scaling relation (f2"2"l) , we rederive K ~ TV 1 ^ 7-1 ), which is indeed valid for scale-free nets in general, be they 
fractal or transfractal. 
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PROPERTIES OF FRACTAL AND TRANSFRACTAL NETWORKS 

The existence of fractality in complex networks immediately calls for the question of what is the importance of 
such a structure in terms of network properties. In general, most of the relevant applications seem to be modified 
to a larger or lesser extent, so that fractal networks can be considered to form a separate network sub-class, sharing 
the main properties resulting from the wide distribution of regular scale-free networks, but at the same time bearing 
novel properties. Moreover, from a practical point of view a fractal network can be usually more amenable to analytic 
treatment. 

In this section we summarize some of the applications that seem to distinguish fractal from non-fractal networks. 

Modularity 

Modularity is a property closely related to fractality. Although this term does not have a unique well-defined 
definition we can claim that modularity refers to the existence of areas in the network where groups of nodes share 
some common characteristics, such as preferentially connecting within this area (the 'module') rather than to the rest 
of the network. The isolation of modules into distinct areas is a complicated task and in most cases there are many 
possible ways (and algorithms) to partition a network into modules. 

Although networks with significant degree of modularity are not necessarily fractals, practically all fractal networks 
are highly modular in structure. Modularity naturally emerges from the effective 'repulsion' between hubs. Since the 
hubs are not directly connected to each other, they usually dominate their neighborhood and can be considered as 
the 'center of mass' for a given module. The nodes surrounding hubs are usually assigned to this module. 

The renormalization property of self-similar networks is very useful for estimating how modular a given network is, 
and especially for how this property is modified under varying scales of observation. We can use a simple definition 
for modularity M, based on the idea that the number of links connecting nodes within a module, L™, is higher than 
the number of link connecting nodes in different modules, L° ut . For this purpose, the boxes that result from the 
box-covering method at a given length-scale £b are identified as the network modules for this scale. This partitioning 
assumes that the minimization of the number of boxes corresponds to an increase of modularity, taking advantage of 
the idea that all nodes within a box can reach each other within less than £b steps. This constraint tends to assign the 
largest possible number of nodes in a given neighborhood within the same box, resulting in an optimized modularity 
function. 

A definition of the modularity function M that takes advantage of the special features of the renormalization process 
is, thus, the following [48| : 

-i ^ B t in 

^) = ^£^, (23) 

i— 1 1 

where the sum is over all the boxes. 

The value of M through Eq. ([23)1 for a given Ib value is of small usefuleness on its own, though. We can gather 
more information on the network structure if we measure M for different values of £g. If the dependence of M on £b 
has the form of a power-law, as if often the case in practice, then we can define the modularity exponent djw through 

M(£ B ) ~ £ d B M • (24) 

The exponent cIm carries the important information of how modularity scales with the length, and separates modular 
from non-modular networks. The value of du is easy to compute in a <i-dimensional lattice, since the number of links 
within any module scales with its bulk, as L\ n ~ £ g and the number of links outside the module scale with the length 
of its interface, i.e. L° ut ~ £ B ~ 1 - So, the resulting scaling is M ~ £b i.e. — 1. This is also the borderline value 
that separates non- modular structures (cIm < 1) from modular ones (c?m > !)■ 

For the Song-Havlin-Makse fractal model introduced in the previous section, a module can be identified as the 
neighborhood around a central hub. In the simplest version with x = 1, the network is a tree, with well-defined 
modules. Larger values of x mean that a larger number of links are connecting different modules, creating more loops 
and 'blurring' the discreteness of the modules, so that we can vary the degree of modularity in the network. For this 
model, it is also possible to analytically calculate the value of the exponent du- 

During the growth process at step t, the diameter in the network model increases multiplicatively as L(t+l) — 3L(t). 
The number of links within a module grows with 2m + x (each node on the side of one link gives rise to m new links and 
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x extra links connect the new nodes), while the number ol links pointing out of a module is by definition proportional 
to x. Thus, the modularity M(Ib) of a network is proportional to (2m + x)/x. Eq. (|24|) can then be used to calculate 
cLm for the model: 



which finally yields 



~ 3 d - , (25) 



In (2*2 + 1) 

dM = ln3 ' ( 26 ) 



So, in this model the important quantity that determines the degree of modularity in the system is the ratio of the 
growth parameters m/x. 

Most of the real-life networks that have been measured display some sort of modular character, i.e. du > + 
although many of them have values very close to 1. Only in a few cases we have observed exponents du < 1. Most 
interesting is, though, the case of du values much larger than 1, where a large degree of modularity is observed and 
this trend is more pronounced for larger length-scales. 

The importance of modularity as described above can be demonstrated in biological networks. There, it has been 
suggested that the boxes may correspond to functional modules and in protein interaction networks, for example, 
there may be an evolution drive of the system behind the development of its modular structure. 



Robustness 



Shortly after the discovery of the scale-free property, the first important application of their structure was perhaps 
their extreme resilience to removal of random nodes [3, 0, [EI , EH 3, EEl Ell • At the same time such a network was 
found to be q uite vulnerable to an intentional attack, where nodes are removed according to descreasing order of their 
degree [H, [H?]]- The resilience of a network is usually quantified through the size of the largest remaining connected 
cluster S max {p), when a fraction p of the nodes has been removed according to a given strategy. At a critical point p c 
where this size becomes equal to S max (p c ) ~ 0, we consider that the network has been completely disintegrated. For 
the random removal case, this threshold is p c ~ 1, i.e. practically all nodes need to be destroyed. In striking contrast, 
for intentional attacks p c is in general of the order of only a few percent, although the exact value depends on the 
system details. 

Fractality in networks considerably strengthens the robustness against intentional attacks, compared to non-fractal 
networks with the same degree exponent 7. In Fig. [9] the comparison between two such networks clearly shows that 
the critical fraction p c increases almost 4 times from p c ~ 0.02 (non-fractal topology) to p c ~ 0.09 (fractal topology). 
These networks have the same 7 exponent, the same number of links, number of nodes, number of loops and the same 
clustering coefficient, differing only in whether hubs are directly connected to each other. The fractal property, thus, 
provides a way of increasing resistance against the network collapse, in the case of a targeted attack. 

The main reason behind this behavior is the dispersion of hubs in the network. A hub is usually a central node that 
helps other nodes to connect to the main body of the system. When the hubs are directly connected to each other, 
this central core is easy to destroy in a targeted attack leading to a rapid collapse of the network. On the contrary, 
isolating the hubs into different areas helps the network to retain connectivity for longer time, since destroying the 
hubs now is not similarly catastrophic, with most of the nodes finding alternative paths through other connections. 

The advantage of increased robustness derived from the combination of modular and fractal network character, may 
provide valuable hints on why most biological networks have evolved towards a fractal architecture (better chance of 
survival against lethal attacks). 



Degree Correlations 

We have already mentioned the importance of hub-hub correlations or anti-correlations in fractality. Generalizing 
this idea to nodes of any degree, we can ask what is the joint degree probability P(k\, fo) that a randomly chosen link 
connects two nodes with degree k\ and respectively. Obviously, this is a meaningful question only for networks 
with a wide degree distribution, otherwise the answer is more or less trivial with all nodes having similar degrees. A 
similar and perhaps more useful quantity is the conditional degree probability P(k\\k,2), defined as the probability 
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FIG. 9: Vulnerability under intentional attack of a non-fractal Song-Makse-Havlin network (for e = 0) and a fractal Song- 
Makse-Havlin network (for e = 1). The plot shows the relative size of the largest cluster, S, and the average size of the 
remaining isolated clusters, (s) as a function of the removal fraction / of the largest hubs for both networks. 



that a random link from a node having degree k 2 points to a node with degree k\. In general, the following balance 
condition is satisfied 

k 2 P{k x \k 2 )P{k 2 ) = kiP(k 2 \ki)P(h) . (27) 

It is quite straightforward to calculate P(ki\k 2 ) for completely uncorrelated networks. In this case, P(k x \k 2 ) does not 
depend on fc 2 , and the probability to chose a node with degree ki becomes simply P{k\\k 2 ) = fc 1 P(fc 1 )/(fc 1 ). In the 
case where degree-degree correlations are present, though, the calculation of this function is very difficult, even when 
restricting ourselves to a direct numerical evaluation, due to the emergence of huge fluctuations. 

We can still estimate this function, though, using again the self-similarity principle. If we consider that the function 
P(ki,k 2 ) remains invariant under the network renormalization scheme described above, then it is possible to show 
that 

P{k x ,k 2 ) ~ k-^'h^ (fci > k 2 ), (28) 

and similarly 

P{kx\k 2 ) ~ k-^~ 1] k~ (t ^ +1) {ki > k 2 ) , (29) 

In the above equations we have also introduced the correlation exponent e, which characterizes the degree of correla- 
tions in a network. For example, the case of uncorrelated networks is described by the value e = 7 — 1. 

The exponent e can be measured quite accurately using an appropriate quantity. For this purpose, we can introduce 
a measure such as 

[r? P(k\k 2 )dk 2 

which estimates the probability that a node with degree k has neighbors with degree larger than bk, and b is an 
arbitrary parameter that has been shown not to influence the results. It is easy to show that 

E b (k) ~ ^ = k~^ . (31) 



This relation allows us to estimate e for a given network, after calculating the quantity Ef,(k) as a function of k. 
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The above discussion can be equally applied to both fractal and non-fractal networks. If we restrict ourselves to 
fractal networks, then we can develop our theory a bit further. If we consider the probability £(£b) that the largest 
degree node in each box is connected directly with the other largest degree nodes in other boxes (after optimally 
covering the network), then this quantity scales as a power law with lg\ 

£(£ b ) ~ l B d ' , (32) 

where d e is a new exponent describing the probability of hub- hub connection [Til ]. The exponent e, which describes 
correlations over any degree, is related to d e , which refers to correlations between hubs only. The resulting relation is 

e = 2 + rf e /4 = 2 + ( 7 -l)^l. (33) 

Ob 

For an infinite fractal dimension dg — > oo, which is the onset of non- fractal networks that cannot be described by the 
above arguments, we have the limiting case of e = 2. This value separates fractal from non-fractal networks, so that 
fractality is indicated by e > 2. Also, we have seen that the line e = 7 — 1 describes networks for which correlations 
are minimal. Measurements of many real-life networks have verified the above statements, where networks with e > 2 
having been clearly characterized as fractals with alternate methods. All non-fractal networks have values of e < 2 
and the distance from the e = 7 — 1 line determines how much stronger or weaker the correlations are, compared to 
the uncorrelated case. 

In short, using the self-similarity principle makes it possible to gain a lot of insight on network correlations, a 
notoriously difficult task otherwise. Furthermore, the study of correlations can be reduced to the calculation of a 
single exponent e, which is though capable of delivering a wealth of information on the network topological properties. 



Diffusion and Resistance 



Scale-free networks have been described as objects of infinite dimensionality. For a regular structure this statement 
would suggest that one can simply use the known diffusion laws for d — 00. Diffusion on scale-free structures, 
however, is much harder to study, mainly due to the lack of translational symmetry in the system and different local 
environments. Although exact results are still not available, the scaling theory on fractal networks provides the tools 
to better understand processes, such as diffusion and electric resistance. 

In the following, we describe diffusion through the average first-passage time Tab, which is the average time for a 
diffusing particle to travel from node A to node B. At the same time, assuming that each link in the network has an 
electrical resistance of 1 unit, we can describe the electrical properties through the resistance between the two nodes 
A and B, Rab- 

The connection between diffusion (first-passage time) and electric networks has long been established in homo- 
geneous systems. This connection is usually expressed through the Einstein relation [161 ]. The Einstein relation is 
of great importance because it connects a static quantity Rab with a dynamic quantity Tab- In other words, the 
behavior of a diffusing particle can be inferred by simply having knowledge of a static topological property of the 
network. 

In any renormalizable network the scaling of T and R follow the form: 

T'/T = t B dw , R'/R = l~ c , (34) 

where T 1 (R 1 ) and T (R) are the first-passage time (resistance) for the renormalized and original networks, respectively. 
The dynamical exponents d w and £ characterize the scaling in any lattice or network that remains invariant under 
renormalization. The Einstein relation relates these two exponents through the dimensionality of the substrate ds, 
according to: 

d w = ( + d B . (35) 

The validity of this relation in inhomogeneous complex networks, however, is not yet clear. Still, in fractal and 
transfractal networks there are many cases where this relation has been proved to be valid, hinting towards a wider 
applicability. For example, in Refs. 5^, 5^] it has been shown that the Einstein Relation [l6| in (it, w)-flowers and 



(it, u)-trees is valid for any u and v, that is for both fractal and transfractal networks. In general, in terms of the 
scaling theory we can study diffusion and resistance (or conductance) in a similar manner [481 ] . 

Because of the highly inhomogeneous character of the structure, though, we are interested in how these quantities 
behave as a function of the end-node degrees k\ and k% when they are separated by a given distance £. Thus, we are 



17 




FIG. 10: Typical behavior of the probability distributions for the resistance R vs R' and the diffusion time T vs T', respectively, 
for a given £b value. Similar plots for other Ib values verify that the ratios of these quantities during a renormalization stage 
are roughly constant for all pairs of nodes in a given biological network. 




FIG. 11: Average value of the ratio of resistances R/R' and diffusion times T/T' , as measured for different Ib values (each 
point corresponds to a different value of Lb)- Results are presented for both biological networks, and two fractal network models 
with different du values. The slopes of the curves correspond to the exponents C/^s ( to P panel) and d w /d,B (bottom panel). 



looking for the full dependence of T(i; ki, fe) and R(£; ki, fe). Obviously, for lattices or networks with narrow degree 
distribution there is no degree dependence and those results should be a function of £ only. 
For self-similar networks, we can rewrite Eq. (|34p above as 



T> / N '\<^ /dB R! /N'\ c/dB . . 

T = KW) >R = {-N) ■ (36) 

where we have taken into account Eq. ((3]). This approach offers the practical advantage that the variation of N'/N is 
larger than the variation of £b, so that the exponents calculation can be more accurate. To calculate these exponenets, 
we fix the box size £b and we measure the diffusion time T and resistance R between any two points in a network 
before and after renormalization. If for every such pair we plot the corresponding times and resistances in T' vs T 
and R' vs R plots, as shown in Fig. [TOl then all these points fall in a narrow area, suggesting a constant value for the 
ratio T'/T over the entire network. Repeating this procedure for different £b values yields other ratio values. The 
plot of these ratios vs N'/N (Fig. [TTj) finally exhibits a power-law dependence, verifying Eq. (j36|) . We can then easily 
calculate the exponents d w and £ from the slopes in the plot, since the ds exponent is already known through the 
standard box-covering methods. It has been shown that the results for many different networks are consistent, within 



statistical error, with the Einstein relation [481. 1501] . 

The dependence on the degrees ki, k^ and the distance I can also be calculated in a scaling form using the self- 
similarity properties of fractal networks. After renormalization, a node with degree A: in a given network, will have 
a degree k' = £~^ dk k according to Eq. (|9|). At the same time all distances £ are scaled down according to £' = £/£b- 
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FIG. 12: Rescaling of (a) the resistance and (b) the diffusion time according to Eqs. (141[1 and (|42|l for the protein interaction 
network of yeast (upper symbols) and the Song-Havlin-Makse model for e = 1 (lower filled symbols). The data for PIN have 
been vertically shifted upwards by one decade for clarity. Each symbol corresponds to a fixed ratio ki /fe and the different colors 
denote a different value for k%. Inset: Resistance R as a function of distance £, before rescaling, for constant ratio k\jk% = 1 
and different fei values. 



This means that Eqs. (|36[) can be written as 

R'(l';k[,k' 2 )=l B c R{£;k 1 ,k 2 ) (37) 

T'(£';k' 1 ,k 2 )=£ B d -T(£;k 1 ,k 2 ). (38) 

Substituting the renormalized quantities we get: 

R'(£B 1 £;£B dk h ) £s dk k 2 ) = £ B Q R{£;k u k 2 ) . (39) 

The above equation holds for all values of £g, so we can select this quantity to be £b = fc^ . This constraint allows 
us to reduce the number of variables in the equation, with the final result: 

R\-^;^,^j =k 2 C/dk R(£ B ;k 1 ,k 2 ). (40) 

This equation suggests a scaling for the resistance R: 

R(£ ;kl ,k 2 ) = kl /dk f R (^-,^j , (41) 

where fu() is an undetermined function. All the above arguments can be repeated for the diffusion time, with a 
similar expression: 

^; fcl) fc 2 ) = ^/r^,|) , (42) 

where the form of the right-hand function may be different. The final result for the scaling form is Eqs. (|4ip and 
(|4"2")l . which is also supported by the numerical data collapse in Fig. [T2] Notice that in the case of homogeneous 
networks, where there is almost no fc-dependence, the unknown functions in the rhs reduce to the forms /.r(x, 1) = , 
fr(x,l) = x w , leading to the well-established classical relations R ~ ft and T ~ £ dm . 



Future Directions 



Fractal networks combine features met in fractal geometry and in network theory. As such, they present many 
unique aspects. Many of their properties have been well-studied and understood, but there is still a great amount of 
open and unexplored questions remaining to be studied. 
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Concerning the structural aspects of fractal networks, we have described that in most networks the degree dis- 
tribution P(k), the joint degree distribution P(fci,fc2) and a number of other quantities remain invariant under 
renormalization. Are there any quantities that are not invariable, and what would their importance be? 

Of central importance is the relation of topological features with functionality. The optimal network covering leads 
to the partitioning of the network into boxes. Do these boxes carry a message other than nodes proximity? For 
example, the boxes could be used as an alternative definition for sep arated communities, and fractal methods could 
be used as a novel method for community detection in networks 

The networks that we have presented are all static, with no temporal component, and time evolution has been 
ignored in all our discussions above. Clearly, biological networks, the WWW, and other networks have grown (and 
continue to grow) from some earlier simpler state to their present fractal form. Has fractality always been there or 
has it emerged as an intermediate stage obeying certain evolutionary drive forces? Is fractality a stable condition or 
growing networks will eventually fall into a non-fractal form? 

Finally, we want to know what is the inherent reason behind fractality. Of course, we have already described how 
hub-hub anti-correlations can give rise to fractal networks. However, can this be directly related to some underlying 
mechanism, so that we gain some information on the process? In general, in Biology we already have some idea on 
the advantages of adopting a fractal structure. Still, the question remains: why fractality exists in certain networks 
and not in others? Why both fractal and non-fractal networks are needed? It seems that we will be able to increase 
our knowledge for the network evolutionary mechanisms through fractality studies. 

In conclusion, a deeper understanding of the self-similarity, fractality and transfractality of complex networks will 
help us analyze and better understand many fundamental properties of real-world networks. 



APPENDIX: THE BOX COVERING ALGORITHMS 



The estimation of the fractal dimension and the self-similar features in networks have become standard properties 
in the study of real-world systems. For this reason, in the last three years many box covering algorithms have been 
proposed 2J,|63(. This section presents four of the main algorithms, along with a brief discussion on the advantages 
and disadvantages that they offer. 

Recalling the original definition of box covering by Hausdorff EE EH , for a given network G and box size £b, 
a box is a set of nodes where all distances £ij between any two nodes i and j in the box are smaller than £b- The 
minimum number of boxes required to cover the entire network G is denoted by Nb- For £g = 1, each box encloses 
only 1 node and therefore, Nb is equal to the size of the network N. On the other hand, Nb = 1 for Ib > £g ax , 
where £^ ax is the diameter of the network plus one. 

The ultimate goal of a box-covering algorithm is to find the minimum number of boxes Nb{£b) for any £b- It has 
been shown that this problem belongs to the family of NP-hard problems (65| , which means that the solution cannot 
be achieved in polynomial time. In other words, for a relatively large network size, there is no algorithm that can 
provide an exact solution in a reasonably short amount of time. This limitation requires treating the box covering 
problem with approximations, using for example optimization algorithms. 



The greedy coloring algorithm 

The box-covering problem can be mapped into another NP-hard problem [651 ]: the graph coloring problem. 

An algorithm that approximates well the optimal solution of this problem was presented in For an arbitrary 
value of £bi first construct a dual network G' , in which two nodes are connected if the distance between them in G 
(the original network) is greater or equal than £b- Fig. [13] shows an example of a network G which yields such a dual 
network G' for £g = 3 (upper row of Fig. [T3")) . 

Vertex coloring is a well-known procedure, where labels (or colors) are assigned to each vertex of a network, so 
that no edge connects two identically colored vertices. It is clear that such a coloring in G' gives rise to a natural 
box covering in the original network G, in the sense that vertices of the same color will necessarily form a box since 
the distance between them must be less than £b- Accordingly, the minimum number of boxes Nb{G) is equal to the 
minimum required number of colors (or the chromatic number) in the dual network G' , x(G'). 

In simpler terms, (a) if the distance between two nodes in G is greater than £b these two neighbors cannot belong 
in the same box. According to the construction of G', these two nodes will be connected in G' and thus they cannot 
have the same color. Since they have a different color they will not belong in the same box in G. (b) On the contrary, 
if the distance between two nodes in G is less than £b it is possible that these nodes belong in the same box. In G' 
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FIG. 13: Illustration of the solution for the network covering problem via mapping to the graph coloring problem. Starting 
from G (upper left panel) we construct the dual network G' (upper right panel) for a given box size (here £b — 3), where two 
nodes are connected if they are at a distance £ > £b- We use a greedy algorithm for vertex coloring in G' , which is then used 
to determine the box covering in G, as shown in the plot. 

these two nodes will not be connected and it is allowed for these two nodes to carry the same color, i.e. they may 
belong to the same box in G, (whether these nodes will actually be connected depends on the exact implementation 
of the coloring algorithm). 

The algorithm that follows both constructs the dual network G' and assigns the proper node colors for all £b values 
in one go. For this implementation a two-dimensional matrix ae of size N x £^ ax is needed, whose values represent 
the color of node i for a given box size £ = Ib- 

1. Assign a unique id from 1 to N to all network nodes, without assigning any colors yet. 

2. For all £b values, assign a color value to the node with id=l, i.e. cu — 0. 

3. Set the id value i = 2. Repeat the following until i = N. 

(a) Calculate the distance tij from i to all the nodes in the network with id j less than i. 

(b) Set £ B = 1 

(c) Select one of the unused colors Cji i} from all nodes j < i for which > Ib- This is the color cu B of node 
i for the given Ib value. 

(d) Increase £b by one and repeat (c) until Ib — ^|§ ax . 

(e) Increase i by 1. 

The results of the greedy algorithm may depend on the original coloring sequence. The quality of this algorithm 
was investigated by randomly reshuffling the coloring sequence and applying the greedy algorithm several times and 
in different models [24[. The result was that the probability distribution of the number of boxes Nb (for all box 
sizes Ib) is a narrow Gaussian distribution, which indicates that almost any implementation of the algorithm yields 
a solution close to the optimal. 

Strictly speaking, the calculation of the fractal dimension ds through the relation Nb ~ 1~b^ b is valid only for the 
minimum possible value of Nb, for any given £b value, so any box covering algorithm must aim to find this minimum 
Nb- Although there is no rule to determine when this minimum value has been actually reached (since this would 
require an exact solution of the NP-hard coloring problem) it has been shown [6(1 ] that the greedy coloring algorithm 
can, in many cases, identify a coloring sequence which yields the optimal solution. 

Burning algorithms 

This section presents three box covering algorithms based on more traditional breadth-first search algorithm. 

A box is defined as compact when it includes the maximum possible number of nodes, i.e. when there do not exist 
any other network nodes that could be included in this box. A connected box means that any node in the box can be 
reached from any other node in this box, without having to leave this box. Equivalently, a disconnected box denotes 
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FIG. 14: Our definitions for a box that is (a) non-compact for £b = 3, i.e. could include more nodes, (b) compact, (c) connected, 
and (d) disconnected (the nodes in the right box are not connected in the box), (e) For this box, the values Ib = 5 and tb — 2 
verify the relation £b = 2r_g + 1. (f) One of the pathological cases where this relation is not valid, since £b = 3 and tb — 2. 




Belongs to box 
O Just selected 
® Candidate 
O Out of box 



(f) (e) (d) 



FIG. 15: Illustration of the CBB algorithm for Ib = 3. (a) Initially, all nodes are candidates for the box. (b) A random node is 
chosen, and nodes at a distance further than Is from this node are no longer candidates, (c) The node chosen in (b) becomes 
part of the box and another candidate node is chosen. The above process is then repeated until the box is complete. 



a box where certain nodes can be reached by other nodes in the box only by visiting nodes outside this box. For a 
demonstration of these definitions see Fig. [TU 



Burning with the diameter Is, and the Compact-Box-Burning (CBB) algorithm 

The basic idea of the CBB algorithm for the generation of a box is to start from a given box center and then expand 
the box so that it includes the maximum possible number of nodes, satisfying at the same time the maximum distance 
between nodes in the box lg. The CBB algorithm is as follows (see Fig. [T5|) : 

1. Initially, mark all nodes as uncovered. 

2. Construct the set C of all yet uncovered nodes. 

3. Choose a random node p from the set of uncovered nodes C and remove it from C. 

4. Remove from C all nodes i whose distance from p is £pi ^ since by definition they will not belong in the 
same box. 

5. Repeat steps (3) and (4) until the candidate set is empty. 

6. Repeat from step (2) until all the network has been covered. 



Random Box Burning 

In 2006, J. S. Kim et al. presented a simple algorithm for the calculation of fractal dimension in networks 67, 68|, 6t|: 

1. Pick a randomly chosen node in the network as a seed of the box. 

2. Search using breath-first search algorithm until distance Ib from the seed. Assign all newly burned nodes to the 
new box. If no new node is found, discard and start from (1) again. 
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FIG. 16: Burning with the radius tb from (a) a hub node or (b) a non-hub node results in very different network coverage. In 
(a) we need just one box of tb — 1 while in (b) 5 boxes are needed to cover the same network. This is an intrinsic problem 
when burning with the radius, (c) Burning with the maximum distance Is (in this case is — Itb + 1 = 3) we avoid this 
situation, since independently of the starting point we would still obtain Nb = 1. 

3. Repeat (1) and (2) until all nodes have a box assigned. 

This Random Box Burning algorithm has the advantage of being a fast and simple method. However, at the same 
time there is no inherent optimization employed during the network coverage. Thus, this simple Monte-Carlo method 
is almost certain that will yield a solution far from the optimal and one needs to implement many different realizations 
and only retain the smallest number of boxes found out of all these realizations. 

Burning with the radius rs, and the Maximum-Excluded-Mass-Burning (MEMB) algorithm 

A box of size Is includes nodes where the distance between any pair of nodes is less than Ib- It is possible, though, 
to grow a box from a given central node, so that all nodes in the box are within distance less than a given box radius 
tb (the maximum distance from a central node). This way, one can still recover the same fractal properties of a 
network. For the original definition of the box, £b corresponds to the box diameter (maximum distance between any 
two nodes in the box) plus one. Thus, Ib and tb are connected through the simple relation £g = 2rs + 1. In general 
this relation is exact for loopless configurations, but in general there may exist cases where this equation is not exact 

(Fig. mi). 

The MEMB algorithm always yields the optimal solution for non scale-free homogeneous networks, since the choice 
of the central node is not important. However, in inhomogeneous networks with wide-tailed degree distribution, such 
as scale-free networks, this algorithm fails to achieve an optimal solution because of the presence of hubs. 

The MEMB, as a difference from the Random Box Burning and the CBB, attempts to locate some optimal central 
nodes which act as the burning origins for the boxes. It contains as a special case the choice of the hubs as centers of 
the boxes, but it also allows for low-degree nodes to be burning centers, which sometimes is convenient for finding a 
solution closer to the optimal. 

In the following algorithm we use the basic idea of box optimization, in which each box covers the maximum possible 
number of nodes. For a given burning radius rs, we define the excluded mass of a node as the number of uncovered 
nodes within a chemical distance less than rg- First, calculate the excluded mass for all the uncovered nodes. Then, 
seek to cover the network with boxes of maximum excluded mass. The details of this algorithm are as follows (see 
Fig. [17]): 

1. Initially, all nodes are marked as uncovered and non-centers. 

2. For all non-center nodes (including the already covered nodes) calculate the excluded mass, and select the node 
p with the maximum excluded mass as the next center. 

3. Mark all the nodes with chemical distance less than rs from p as covered. 

4. Repeat steps (2) and (3) until all nodes are either covered or centers. 

Notice that the excluded mass has to be updated in each step because it is possible that it has been modified during 
this step. A box center can also be an already covered node, since it may lead to a larger box mass. After the above 
procedure, the number of selected centers coincides with the number of boxes Nb that completely cover the network. 
However, the non-center nodes have not yet been assigned to a given box. This is performed in the next step: 

1. Give a unique box id to every center node. 
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O Uncovered, non-center 



FIG. 17: Illustration of the MEMB algorithm for tb = 1- Upper row: Calculation of the box centers (a) We calculate the 
excluded mass for each node, (b) The node with maximum mass becomes a center and the excluded masses are recalculated. 

(c) A new center is chosen. Now, the entire network is covered with these two centers. Bottom row: Calculation of the boxes 

(d) Each box includes initially only the center. Starting from the centers we calculate the distance of each network node to the 
closest center, (e) We assign each node to its nearest box. 



2. For all nodes calculate the "central distance" , which is the chemical distance to its nearest center. The central 
distance has to be less than tb , and the center identification algorithm above guarantees that there will always 
exist such a center. Obviously, all center nodes have a central distance equal to 0. 

3. Sort the non-center nodes in a list according to increasing central distance. 

4. For each non-center node i, at least one of its neighbors has a central distance less than its own. Assign to i the 
same id with this neighbor. If there exist several such neighbors, randomly select an id from these neighbors. 
Remove i from the list. 

5. Repeat step (4) according to the sequence from the list in step (3) for all non-center nodes. 



Comparison between algorithms 

The choice of the algorithm to be used for a problem depends on the details of the problem itself. If connected 
boxes are a requirement, MEMB is the most appropriate algorithm; but if one is only interested in obtaining the 
fractal dimension of a network, the greedy-coloring or the random box burning are more suitable since they are the 
fastest algorithms. 

As explained previously, any algorithm should intend to find the optimal solution, that is, find the minimum number 
of boxes that cover the network. Fig. [18] shows the performance of each algorithm. The greedy-coloring, the CBB 
and MEMB algorithms exhibit a narrow distribution of the number of boxes, showing evidence that they cover the 
network with a number of boxes that is close to the optimal solution. Instead, the Random Box Burning returns a 
wider distribution and its average is far above the average of the other algorithms. Because of the great ease and 
speed with which this technique can be implemented, it would be useful to show that the average number of covering 
boxes is overestimated by a fixed proportionality constant. In that case, despite the error, the predicted number of 
boxes would still yield the correct scaling and fractal dimension. 
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FIG. 18: Comparison of the distribution of Nb for 10 4 realizations of the four network covering methods presented in this 
paper. Notice that three of these methods yield very similar results with narrow distributions and comparable minimum values, 
while the random burning algorithm fails to reach a value close to this minimum (and yields a broad distribution). 
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